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Abstract. We focus on 0-rich and almost (9-rich words over a finite 
alphabet A, where & is an involutive antimorphism over .4*. We show 
that any recurrent almost ©-rich word u is an image of a recurrent &'- 
rich word under a suitable morphism, where &' is again an involutive 
antimorphism. Moreover, if the word u is uniformly recurrent, we show 
that O' can be set to the reversal mapping. We also treat one special 
case of almost 0-rich words. We show that every ©-standard words with 
seed is an image of an Arnoux-Rauzy word. 
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1 Introduction 

In this paper we will deal with infinite words over a finite alphabet A. A word 
u 6 _4. N we are interested in has its language £(u) saturated, in a certain sense, by 
generalized palindromes, here called (9-palindromes. We will use the symbol for 
an involutive antimorphism, i.e., a mapping : A* n> A* such that (9 2 = Id and 
0(uv) = 0(v)0(u) for all u, v 6 A*. Fixed points of are called (9-palindromes. 
A word it; is a (9-palindrome if 0(w) — w. The most common antimorphism used 
in combinatorics on words is the reversal mapping. We will denote it by 0q. The 
reversal mapping associates to every word w — w\W2 ■ ■ ■ w n its mirror image 
0q(w) = w n w n -i . . . w\. In the case w = 0q(w) 7 we will sometimes say that w 
is a palindrome or classical palindrome instead of 6*o-palindromc. 

The set of distinct (9-palindromes occurring in a finite word w will be denoted 
Palg)(ui). Since the empty word e is a (9-palindrome for any 0, we have a simple 
lower bound #Palg>(w) > 1. 

In 2001, Droubay et al. gave in |llj an upper bound for the reversal mapping 
0o- They deduced that #Pale (w) < |w| + 1, where \w\ denotes the length of 
the word w. In [4], Brlek et al. studied involutive antimorphisms with no fixed 
points of length 1. For such they diminished the upper bound, they showed 
that #Pal6>(u;) < \w\ for all non-empty word w. In [13] . the upper bound is 
precised. The following estimate is valid for any involutive antimorphism 0: 



#Pal e H < \w\ + l-j {w), 



(1) 
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where "f (w) := {{a, 0(a)} \ a occurs in w and a ^ 0(a)}. Let us note that if 
O = Oq, then 70 (w) = 0, and the upper bound in (flj is the same bound as for 
usual palindromes. 

According to the terminology for classical palindromes introduced in |12j and 
for ©-palindromes in |13j . we will say that a finite word w is O-rich if the equality 
in (fT|) holds. An infinite word u € -4 N is O-rich if any its factor w £ £(u) is 
(9- rich. In [5], the authors introduced the palindromic defect of a finite word w 
as the difference between the upper bound \w\ + 1 and the actual number of 
palindromic factors. We define analogously the O '-palindromic defect of w as 

D (w) := \w\ + 1 - 7e(w) - #Pal<s>(iu). 

We define for an infinite word u its O -palindromic defect as 

D (u) = sup{D (w) I w e £(u)}. 

Words with finite ©-palindromic defect will be referred to as almost O-rich. 

In [TO] , it is shown that rich words (i.e. ©o-rich words) can be characterized 
using an inequality shown in [5] for infinite words with languages closed under 
reversal. Results of both mentioned papers were generalized in [13] for an arbi- 
trary involutive antimorphism. In particular, it is shown that if an infinite word 
has its language closed under O, the following inequality holds 

C(n + 1) - C(n) + 2 > V (n) + V (n + 1) for all n>l, (2) 

where C(n) is the factor complexity defined by C(n) :— G C(u) \ n = \w\} 
and V (n) is the O -palindromic complexity defined by Vo(n) := =fj={w £ £( u ) 
w = 0(w) and n = \w\}. The gap between the left-hand side and the right- 
hand side in @ decides about (9-richness. Let us therefore denote by T@(n) the 
quantity 

T (n) := C(n + 1) - C(u) + 2-V (n+l) - V (n). 

In [13] . it is also shown that an infinite word with language closed under O is 
(9-rich if and only if 

T (n) = for all n > 1. 

The list of infinite words which are 6*o-rich is quite extensive. See for instance 
[2 7 9. Examples of (9-rich words can be found in [T] . Fewer examples of words 
with finite non-zero palindromic defect are known. Periodic words with finite 
non-zero 6>o-defect can be found in [5] , aperiodic ones are studied in [12] and [3] . 
To our knowledge, examples of words with < D (u) < +00 and O ^ Oq have 
not yet been explicitly exhibited. As we will show, such examples are ©-standard 
words with seed defined in [5] and thus also their subset, standard 6>-episturmian 
words, which can be constructed from standard episturmian words (see [5]). 

The main aim of this paper is to show that among words with finite ©-pa- 
lindromic defect, ©-rich words, i.e. words with D (u) — 0, play an important 
role. We will show the following theorems. 
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Theorem 1. Let 0% : A* M> A* be an involutive antimorphism. Let u £ A N 
be an infinite recurrent word such that De 1 (u) < +oo. Then there exist an 
involutive antimorphism 02 ■ B* B* , a morphism if : B* i— > A* and an 
infinite recurrent word v £ £> N smc/i i/iai 

u = (p(~v) and v is 02-rich. 

A stronger statement can be shown if the requirement of uniform recurrence is 
imposed on the word u. 

Theorem 2. Let O : A* h4 .4* fee an involutive antimorphism. Let u G „4 N be 
an infinite uniformly recurrent word such that Dq(u) < +00. Then there exist 
a morphism (p : B* M> A* and an infinite uniformly recurrent word v £ £> N such 
that 

u = <p(v) and v is 0§-rich. 

One can conclude that rich words, using the classical notion of palindrome, play 
somewhat more important role that (9- rich words for an arbitrary ^ 0q . 

Proofs of the two stated theorems do not provide any relation between the 
size of the alphabet B of the word v and the size of the original alphabet A. In 
the following special case, the size of B can be bounded. Moreover, the word v 
is more specific. 

Theorem 3. Let : A* H> A* be an involutive antimorphism and u £ A N be a 
0-standard word with seed. Then there exist an Arnoux-Rauzy word v £ B N and 
a morphism ip : B* H> A* such that 

u = tp(v) and #B < if A. 

All three mentioned theorems present almost 6*i-rich word as an image of a 
6>2-rich word by a suitable morphism. The opposite question when a morphic 
image of a 6>i-rich word is almost 02-rich is not tackled here. In [12], a type of 
morphisms preserving the set of almost 6>o-rich words is studied. 

2 Properties of words with finite ©-defect 

We will consider mainly infinite words u = (tt n )„gN S A N having their language 
jC(u) closed under a given involutive antimorphism 0. In other words, for any 
factor w £ £(u) we have Q{w) £ £(u). 

For any factor w £ C(u) there exists an index i such that w is a prefix of 
the infinite word UiUi + iiii+2 ■ ■ ■■ Such an index is called an occurrence of w in u. 
If each factor of u has infinitely many occurrences in u, the infinite word u is 
said to be recurrent. It is easy to see that if the language of u is closed under 0, 
then u is recurrent. For a recurrent infinite word u, we may define the notion of 
a complete return word of any w £ C(u). It is a factor v £ C(u) such that w is a 
prefix and a suffix of v and w occurs in v exactly twice. Under a return word of 
a factor w we usually mean a word q £ £(u) such that qw is a complete return 
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word of w. If any factor w £ £(u) has only finitely many return words, then the 
infinite word u is called uniformly recurrent. 

An important role for the description of languages closed under is played 
by the so-called super reduced Rauzy graphs G„(u). Before defining them, we 
will introduce some necessary notions. 

We say that a factor w 6 £(u) is left special (LS) if w has at least two left 
extensions, i.e., if there exist two letters a,b £ A, a ^ b, such that aw, bw 6 £(u). 
A right special (RS) factor is defined analogously. If a factor is LS and RS, we 
refer to it as bispecial. The closedness under assures the following relation: a 
factor w is LS if and only if the factor 0(w) is RS. 

An n-simple path e is a factor of u of length at least n+l such that the only 
special (right or left) factors of length n occurring in e are its prefix and suffix of 
length n. If w is the prefix of e of length n and v is the suffix of e of length n, we 
say that the n-simple path e begins with w and ends with v. We will denote by 
G n (u) an undirected graph whose set of vertices is formed by unordered pairs 
(w,0(w)) such that w G C(u), \w\ — n, is RS or LS. We connect two vertices 
(w, 0(w)) and (v, 0(v)) by an unordered pair (e, 0(e)) if e or 0(e) is an n-simple 
path beginning with w or 0(w) and ending with v or 0(v). Note that the graph 
G„(u) may have multiple edges and loops. 

Surprisingly, the super reduced Rauzy graph G n (u) can be used to detect 
the equality in Let us cite Corollary 7 from |13j . 

Proposition 4. Let n 6 N and £(u) be closed under 0. Then T@(n) — if and 
only if 

1. all n-simple paths forming a loop in G n (u) are -palindromes and 

2. G n (u) after removing loops is a tree. 

Analogously to the case of the reversal mapping, one can see from the defi- 
nition of ©-defect that an infinite word u has finite 0-defect if and only if there 
exists an integer H such that of every prefix p of u of length greater than H 
has a unioccurrent 0-palindromic suffix, i.e., a suffix occurring exactly once in 
p. We will use this fact to prove the following lemma. 

Lemma 5. Let u be a recurrent infinite word with finite Q -defect. Then £(u) 
is closed under 0. 

Proof. Let H be an integer such that every prefix of u of length greater than H 
has a unioccurrent ©-palindromic suffix. Suppose that w is a factor of u such 
that 0(w) £ £(u). Since u is recurrent, we can find two consecutive occurrences 
i and j of the factor w such that i,j>H and i < j. Denote p the prefix of u 
ending with w occurring at j, i.e., \p\ = j + \w\. Since \p\ > H, there exists a 
unioccurrent ©-palindromic suffix of p. Denote s to be such a suffix. If \s\ < \w\, 
then s is a factor of w and thus occurs at least twice in p - a contradiction 
with the unioccurrence of s. If \s\ > \w\, the w is a factor of s which is a 0- 
palindrome and thus contains 0(w) as well - a contradiction with the assumption 
that 0(w) <£ £(u). 
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In [3J, various properties are shown for words with finite 0q -palindromic de- 
fect. These properties and their proofs are valid even if we replace the antimor- 
phism 6>o by an arbitrary 0. Therefore, we mention here the relevant statements 
without proving them. 

Proposition 6. Let u be an infinite recurrent word such that Dq(u) < +oo. 
Then there exists a positive integer H such that 

— every prefix of u longer than H has a unioccurent -palindromic suffix; 

— for any factor w G £(u) such that \w\ > H, occurrences of w and 0(w) in 
the word u alternate; 

— for any w G £(u) such that \w\ > H, every factor v G £(u) beginning 
with w, ending with Q(w), and with no other occurrences of w or 0(w) is a 
-palindrome; 

— T@{n) = for any integer n > H. 

As already mentioned, the first property listed in the previous proposition, in 
fact, characterizes words with finite (9-defect. We do not know whether this is 
the case of the remaining properties. If we restrict our attention to uniformly 
recurrent words, only then several characterizations of words with finite (9-defect 
can be shown. The next proposition states two of them that we will use in what 
follows. Again, the proposition is based on the work done in [3] for = 0g. No 
modifications besides replacing 0q by in its proof are needed, therefore, we 
will omit it. 

Proposition 7. Let u be a uniformly recurrent infinite word with language 
closed under 0. The following statements are equivalent. 

— D@(u) < +oo; 

— there exists a positive integer K such that for any 0-palindrome w € £(u) 
of length \w\ > K, all complete return words of w are -palindromes; 

— there exists a positive integer H such that for any w G £(u), the longest 
-palindromic suffix of w is unioccurrent in w. 

A (9-standard word with seed is an infinite word defined by using ©-palindromic 
closure, for details see [5] . Construction of such word u guarantees that u is uni- 
formly recurrent (cf. Proposition 3.5. in [5]). The authors of [5] showed (Propo- 
sition 4.8) that any complete return word of a sufficiently long ©-palindromic 
factor is a 0-palindrome as well. Therefore, (9-standard words with seed serve 
as an example of almost (9-rich words. 

Corollary 8. Let u be a -standard word with seed. Then D@(u) < +oo. 
3 Proofs 

In this section wegive proofs of all three theorems stated in Introduction. Al- 
though Theorem Q seems to be only a refinement of Theorem [l|, constructions 
of the morphisms if in their proofs differ substantially. It is caused by stronger 
properties we may exploit for a uniformly recurrent word. 
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Proof (Proof of Theorem Qj. Recall that according to Lemma [B| the language 
C(u) is closed under 0\. 

If u is an eventually periodic word with language closed under Q\ , then u is 
purely periodic. Any purely periodic word is a morphic image of a word v over 
one-letter alphabet under the morphism which assigns to this letter the period of 
u. Therefore we may assume without loss of generality that u is not eventually 
periodic. 

Since De 1 (u) < +00, according to Propositions and [f| there exists H € N 
such that 

1. Vw G £(u), |uy I > H, occurrences of w and 0\(w) alternate; 

2. Vw G jC(u), \w\ > H, every factor beginning with w, ending with &\(w) and 
with no other occurrences of w or 0\(w) is a 0\ -palindrome; 

3. Vn > H, every loop in G n (u) is a 6>i-palindrome and G„(u) after removing 
loops is a tree. 

Fix n > H. If an edge (b, 0i(b)) in G„(u) is a loop, then, according to 
the property |3l we have b = 0i(b). If the edge (b,0i(b)) connects two distinct 
vertices (iui, 0i(wi)) and (11)2, @i(u> 2 )), then there exist exactly two n-simple 
paths b and such that WLOG the n-simple path b begins with wi and 

ends with u>2 and the simple path 0±(b) begins with 6*i(ui2) and ends with 
0i{wi). 

We assign to every n-simple path b a new symbol [b], i.e., we define the 
alphabet B as 

B := {[b] I b G C(u) is an n-simple path} 

and on this alphabet we define an involutive antimorphism 0i : B* h4 in the 
following way: 

2 ([6]) := [©!(&)]. 

We are now going to construct a suitable infinite word v G B N . Let (si)jgN 
denote a strictly increasing sequence of indices such that Si is an occurrence of 
RS or LS factor of length n and every RS and LS factor of length n occurs at 
some index s,. We define v = (vi)i^ by the formula 

Vi = [b] if b = u Si u Si+1 ...u Si+1+n -x. 

This construction can be done for any n > H. Since infinitely many prefixes 
of u are LS or RS factors, we can choose such n > H that the prefix of u of 
length n is LS or RS, i.e., sq — 0. 

According to Proposition 12 in [13] . to prove that v is 02-rich we need to 
show the following: 

(i) for every non-empty factor w G £(v), any factor v beginning with w and 
ending with 6*2 (w), with no other occurrences of w or 02{w), is a 02- 
palindrome; 

(ii) for every letter [b] G B such that [b] ^ 02 ([b]), the occurrences of [b] and 
02 {[b]) in the word v alternate. 
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Let us first verify (jl]). Let e and / be factors of v such that e is a prefix 
and 02(e) is a suffix of / and there are no other occurrences of e or 02(e) 
in /. In that case there exist integers r < k such that / = [&i][&2] • ■ • [bk] and 
e = [61] [62] ■ • ■ [br]- The case r = k is trivial. Suppose r < k. Since v is defined 
as a coding of consecutive occurrences of n-simple paths in u, factor / codes a 
certain segment of the word u. Let us denote that segment F — Uj . . . u; where 
j = St for some t £ N and I = St+k—~L + n— 1. Factor e codes in the same way a 
factor E = Uj . . .Uh where h — st+ r -i + n — 1. 

Due to the definition of 02, the fact that e is a prefix of / and 02(e) is a 
suffix of / ensures that E is a prefix of F and 0(E) is a suffix of F. Suppose 
/ is not a 02-palindrome. This implies that F is not a 0i-palindrome which 
contradicts the property [3J 

Let us now verify ((TTJ) . Consider [b] £ B such that [b] ^ 02 ([&])• Moving along 
the infinite word u = uqU\U2 ■ ■ ■ from the left to the right with a window of 
width n corresponds to a walk in the graph G n (u). The pair b and 0\(b) of n- 
simple paths in u represents an edge in G n (u) connecting two distinct vertices. 
Moreover, moving along the n-simple paths b and moving along 0\(b) can be 
viewed as traversing that edge in opposite directions. Since G n (u) after removing 
loops is a tree, the only way to traverse an edge is alternately in one direction 
and in the other. Thus, the occurrences of letters [b] and 02 ( [b]) in v alternate. 

We have shown that v is 02-rich. It is now obvious how to define a morphism 
ip : B* ^ A*. If an n-simple path b equals b = u Si u Si+ i . . . u Si+1 + n -i, then we 
set tp([b]) := u Si u Si+ i . . . 

Proof (Proof of Theorem^). Recall again that according to Lemma the lan- 
guage £(u) is closed under 0. 

Next, we show that infinitely many 0-palindromes are also prefixes of u. 
Consider an integer H whose existence is guaranteed by Proposition[B]and denote 
by w a prefix of u longer than H. Since occurrences of factors w and 0(w) in 
u alternate, according to the same proposition, the prefix of u ending with the 
first occurrence of 0(w) is a 0-palindrome. 

Let us denote by p a 0-palindromic prefix of u with length \p\ > K where 
K is the constant from Proposition [7] All complete return words of p are 0- 
palindromes. Since u is uniformly recurrent, there exist only finite number of 
complete return words to p. Let A 1 ^, A 2 \ . . . , A M ^> be the list of all these return 
words. Any complete return word has the form q^p = r^ for some factor 

, usually called return word of p. Since and p are 0-palindromes, we have 



Let us define a new alphabet B = {1, 2, . . . , M} and morphism <p : B* — > A* 
by the prescription 



p0(q^) = q^p for any return word q 



(3) 



(f(i) = q 



for i = 1,2,..., M. 



First we will check the validity of the relation 




for any w £ B* . 



(4) 
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Let w = i\%2 ■ ■ - i n - Then Oyp{i\i2 ■ ■ ■ i n )p) equals to 

0( P )0(^(i n ))0(^(i„- 1 )) . . . e(^i)) = P e( q ^)e( q ^) . . .e( q W) 

and we may apply gradually n times the equality ([3]) to rewrite the right-hand 
side as 

q M g(tn-i) _ _ _ q M p _ (/ 9(i n )^(i„_ 1 ) . . . Lp{ii)p = tp(0 o (hi2 ■ ■ ■ i n ))p- 

This proves the relation (j4]). 

An important property of the morphism ip is its injectivity. Indeed, in accor- 
dance with the definition, number of occurrences of the factor p in ip(w)p equals 
to the number of letters in w plus one. Moreover, each occurrence of p in (p(w)p 
indicates beginning of an image of a letter under ip. Therefore <p(w)p — <p(v)p 
necessarily implies w = v. 

Let us finally define the word v. As p is a prefix of u, the word u can be 
written as a concatenation of return words and thus we can determine a 
sequence v = (v n ) e B n such that 

u = g M g (»l)g(«l) . . . 

Directly from the definition of v we have u = <p(v). Since u is uniformly recur- 
rent, the word v is uniformly recurrent as well. To prove that v is a 6>o-rich word, 
we will show that any complete return word of any @o-palindrome in the word 
v is a 6>o-palindrome as well. According to Theorem 2.14 in [12], this implies 
the 6>o -richness of v. 

Let s be a 0q -palindrome in v and w its complete return word. Then (p(w)p 
has precisely two occurrences of the factor tp{s)p. Since s is a 6*o-palindrome, we 
have according to @ that (f(s)p is a 0-palindrome of length \tp(s)p\ > \p\ > K. 
Therefore ip(w)p is a complete return word of a long enough <9-palindrome and 
according to our assumption <p(w)p is a (9-palindrome as well. Therefore by using 
(0} we have 

(fi(w)p = 0(ip(w)p) = ip(O {w))p 
and injectivity of <p gives w = &q(w), as we claimed. 

Theorem 6.1 in [5] states that every standard (9-episturmian word is an image 
of a standard episturmian word. Again, the role of Oq can be perceived as more 
important. Also, compared to Theorem 0, it may be seen as a special case since 
0-episturmian words, according to Corollary @, have finite 0-defect. 

Proof (Proof of Theorem^). 

If u is periodic, then the claim is trivial. Suppose u is aperiodic. 

We are going repeat the proof of Theorem [2| with a more specific choice of p. 
Theorem 4.4 in [8] implies that there exists L 6 N such that any LS factor of u 
longer than L is a prefix of u. Without loss of generality, we may assume that 
the constant L is already chosen in such a way that all prefixes of u longer than 
L have the same left extensions. Let us denote their number by M. According 
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to the same theorem, infinitely many prefixes of u are 0-palindromes and thus 
bispecial factors as well. 

According to CorollaryJSl u has finite (9-palmdromic defect. Let K be the 
constant from Proposition [jj. Altogether, there exists a bispecial factor p, \p\ > 
max{L, K}, such that it is a prefix of u and a ©-palindrome. Since p is longer 
than K, all complete return words to p are 0-palindromes. As p is the unique left 
special factor of length \p\ in u, its return words (i.e., complete return words after 
erasing the suffix p) end with distinct letters. It means that there are exactly 
M return words of p, denoted again gW. Let us recall that by M we denoted 
number of left extensions of some factor, therefore M < if A. 

The construction of the the word v and prescription of the morphism <p over 
the alphabet B = _f 1, 2, . . . ,M} can be done in exactly the same way as in the 
proof of Theorem |2j. It remains to show that v is an Arnoux-Rauzy word. 

According to Theorem Q we know that v is 0o-rich and uniformly recurrent. 
Applying Lemma we deduce that the language £(v) is closed under reversal. 

Suppose there exist v,w G £(v), two LS factors such that |u| = \w\ and 
v ^ w. Since the words gW end with distinct letters, it is clear that f(w)p is 
a LS factor of u and it has the same number of left extensions as w. The same 
holds for tp(v)p. Since both these factors have their length greater than or equal 
to \p\ > L and are both LS, one must be prefix of another. Let WLOG <p{w)p 
be a prefix of ip(v)p, i.e., f(v)p — tp(ww')p. The injectivity of ip implies w' = e 
and thus v = w - a contradiction. 

Remark 9. Theorem^ can be seen as a generalization of Theorem 6.1 in [5] to 
©-standard words with seed. 

Remark 10. Note also that the proof of Theorem is in fact a combination of 
methods used in preceding proofs of Theorems [Handi in the sense that the set 
of complete return words r^ 1 ' of the factor p and the set of |p|-simple paths in u 
coincide. 
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